|
|
Accession Number |
TCMCG075C21791 |
gbkey |
CDS |
Protein Id |
XP_007021337.2 |
Location |
join(4470566..4470624,4470743..4470919,4471387..4471514,4471789..4471841,4471934..4471986,4472082..4472234,4472691..4472932,4473059..4474758,4475088..4476101) |
Gene |
LOC18593870 |
GeneID |
18593870 |
Organism |
Theobroma cacao |
|
|
Length |
1192aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_007021275.2
|
Definition |
PREDICTED: uncharacterized protein LOC18593870 isoform X1 [Theobroma cacao] |
CDS: ATGGACTTCAGGACACGTCTTGATTATGCTTTGTTCCAACTCACTCCAACTAGAACCAGGTGTGATCTGGTGATTTTTGCTGGGAAAGAGACTGAGAAGTTGGCATCAGGATTGTTAGAACCGTTCATTTTACACCTCAAAAGTGCTAAAGATCAGATTTCAAAAGGAGGGTACTCTATAACTCTTCGTCCAGTAGGCTCAACTCCTTCCTGGTTCACAAAAGGCACCCTGCAAAGGTTTGTGAGGTTTGTTAGCACACCAGAGGTTCTTGAGAGATTTGTGACAGTAGAGAGAGAAATTGAACAGATTGATAATTCAATTCACTCAAATGAAGCAAATGCTGCTGGGGCAACAGAGGCAGATGGAAATGAGTCAGTTATTTCAGGAAATTTCCAAAAGTCAATTTCTTCATTTAAGTCCAAAGGTGAACTCAATGGAACTGCTGATGCTGCGCAGGAAGAAAATTCCAAGGCTCGTCTTCAACGAGTTCTGGAAACTAGGAAGAAAGTACTCTGTAAAGAACAAGCGATGGCTTATGCTCGTGCTTTGGTTGCTGGATATGAACCTGATAATATAGAAGATCTCATATCTTTTGCAGATGCTTTTGGTGCTTCACGTTTAAGGGAAGCTTGCATAAATTTCATGGACTTATGCAAGAGAAAGAATGAAGATAGGCTTTGGATGGCAGAATTAGCAGCAATGCAAGCATGTCCAAGACCAGACTTGTCTTACCTCGGAACATCAGGAATCATACTTGCTGGGGAAGAAAATGATCCTAATCAAAATCTTATGATGAATTTCTCAAGCGGGAAGCAAAATGGTTCTGCTGATGCCTCTGATGCCGGGAGTGGAGATATTAACCCAGATGGTAGCTTGCCATCTGCAGATGGTAAAGCCCAAGTACAAATGCCATGGCCACCCCATCTTCCTCAGTACATGCATAATTTTCAGGGTCCTGGATTTCAACAAATGCCTCCATATCAAGGCTACCTTTTCCCCGGTATGCATGCTGCCTCTCCATATTATCCAGGGAATATGCATTGGCCCCCAAATGTAGAGGATTCTAGTCTTGGTCGTGCTTGGGAACCAGATGATCGTAGAAATCATAAATCATCTTCTAGGAGCAAGAAGAAATCTTCACGTGGTAAGGGAGATGAAACTTCAAAGCAAGATGAATCCACTGAGCCTAGCGATTCCAGCTCTGAGAGTGAACCAGAAGAGCAGGTGCATAAGAAAAAGCATGGAAAGAAATCCTCAAGAAAGGTTGTCATCCGTAACATTAATTACATTTCTTCCAAGAGGAATGGGGAAAAGGGCAGTGATTCTGAAGAGATTTCTGATGAGGATGAGTTCATTGATGGAGATTCTCTCAAACAGCAAGTAGAGGAGGCTGTTGGATCACTGGGGAGACATCATAAATCTACTTCACGTCATCATAAGAAACACGATGGAAGCAAGCATCGAAACACTGTTTCATATGATGAAGAAGAACAGGAAGCTAAGGCTTCTAACGCAAAAAATCCTGAGGGAGAAAAAAGAAACAACCCCTGGGATGCTTTCCAGAACCTTCTGTTGCAGGACAAGGATTTGGATTCCTCAGAAGTAGATCCACAACCAATAAGGTTGCAAGAGGAATATTTTGCAAGCAAGGGCTCTGAGGACGGAAGGTCATCAGCATTTAACCCGAACTCTGAGCGAGCAGCAAAGCAAAAATCAATGTCAAGTGATCCATTTCTGGCCACACAGATGGATAGGGGTCATGAAGGTGACACTCGAGGTAGAAATTTTGGAACTAATGAATTTGGTGGCTCGGTTTTTAAGAGAAGAGAGAGCACAAATGAGGAGTTGTTAATTCTGCAAGGAAATGATTCTGGGATTAATTCACATGCTTTTATCTCTGATTATGCCGCAGAGTCTACTATGATCAAAAGTCGCAAAGAAGGAGAATGGTTTATCAACAACCAACTGGATAAATCAGCAAATCAGGATGAGATCATGGGCCTCAAAATGTTTGATGGGGATCATGCTTCTTCATTAGCTCGTGACCGTTTCAACACTGAGACAAACAAGAATGATGTTTTCGTTGATGACTCTTTCATGATTCAGGGTCCATCAGTGGGAGATGATCAATCTGATTCTCAGTTACGGATAGGTATAGGCATGGTTCCAGAAATTGAAGGTGCTCAATACGAAAATGGCAATTCAGAAAATGTACAGAAGGCTGCTTCTGTTTCCTACGAGCCAGATGACCTTTACATGGTGCTTGGGCGTGATTCAGCTGAGGAAAATGCCATGACTTCTTGGACTCCAGAAATCGACTATGAAATGAACGTATTATCTGCTGAAGCGAATGGAAGACACTCTGATGTTGAAACAACTGGTGCTGATGACAAGGGTGCTAATGGTAAAAACCGTGGAAGTTCTGAGCGTAAACTTTCAAATAAAGAAGTCCGTTCTAGAGTTCCAAATGGATCTCTTGTCAAGAGCAAATCAGACATAGCAGCAAAGACCAGGAAACCTCCAGCTGGAAGCAGAACCACAGTACGGAAAACTAAATTTGATCAGGAAGAGGAGAATCGAAAGAAAATAGAGGAGTTACGGATTCAGCGCCAGAAGAGGATTGCTAAGAGGAGTGTTGCTAGTGGTGCTAATCCAGTTACTTCCAGGAGGAGCTCCACAGAAAATAAAACTTCAACGATTTCCATGAAGAGTCAACCTTTGACTCAGGACACTAAGAAATCACCAAAGCCAGTCCTTAGAAGTTCCACTATAGAACGCCTTGCAACTGCAAGGAATACTTCAAAGGCCTCATCAGCTGAATCAAAAGCCAGCCAGCCCAAAAAGTCAACCTTGAAGGAAAATGGTTCTTCAACAACAGTATCTCAGAAGACTGCTCCTGTTGAAGATAAGAAATCAAGCTCAAACAAAGTCAGAGCTTCAGATAAGAAAAGTGGCCCAAACAAAGTACTTTCCAGTGACTCTGTTGCACAAGGAAAGGACTCCAAAGAGGTCACAGTAGCATTGCCAACGGAGCCAGCAGCACCCAGAGAAACTCAACCTACTGACATTGTTGATAATTTCAAAGACATTCAGGAGTTGCAGAGTACTTCAATAGAAAAAACTGAAGAAAAGGAAATTTCTCAAAGAAACACATCAGAGGACAGAAGCTCCAATGGGAATATGCTTACTGAAGATAAGCCAGTGCAATTAGATCATGTAAAAGGTGACGAGGAATTGACTAAGGCATCTACTGTTGTTTCTGAGGACAAAAGAGCACCAGAAGATTTTGTTGAAGATATTCCTGAGATGACAGTTCATCCGTTGCCACCACTGCCTGTAAAGACTGTTAAGTTTGCCACGGTAAATATAGAAGGGAATGGTGGAATGAATGAAAAGTTTCTGTCACCTAGGATTTCTGAAATAGAGATCTCAACTCCGCCGCCAAATGATGGAATGAACACAGAACCAGTGCACTCCAGGAAGAAATGGAACAATGATGAAACCTCTCCTAAGGCAGCCAAAGGTTTTAGAAAGCTCCTTTTCTTTGGACGAAAAAACCGAAACTCTCCTACTTACTGA |
Protein: MDFRTRLDYALFQLTPTRTRCDLVIFAGKETEKLASGLLEPFILHLKSAKDQISKGGYSITLRPVGSTPSWFTKGTLQRFVRFVSTPEVLERFVTVEREIEQIDNSIHSNEANAAGATEADGNESVISGNFQKSISSFKSKGELNGTADAAQEENSKARLQRVLETRKKVLCKEQAMAYARALVAGYEPDNIEDLISFADAFGASRLREACINFMDLCKRKNEDRLWMAELAAMQACPRPDLSYLGTSGIILAGEENDPNQNLMMNFSSGKQNGSADASDAGSGDINPDGSLPSADGKAQVQMPWPPHLPQYMHNFQGPGFQQMPPYQGYLFPGMHAASPYYPGNMHWPPNVEDSSLGRAWEPDDRRNHKSSSRSKKKSSRGKGDETSKQDESTEPSDSSSESEPEEQVHKKKHGKKSSRKVVIRNINYISSKRNGEKGSDSEEISDEDEFIDGDSLKQQVEEAVGSLGRHHKSTSRHHKKHDGSKHRNTVSYDEEEQEAKASNAKNPEGEKRNNPWDAFQNLLLQDKDLDSSEVDPQPIRLQEEYFASKGSEDGRSSAFNPNSERAAKQKSMSSDPFLATQMDRGHEGDTRGRNFGTNEFGGSVFKRRESTNEELLILQGNDSGINSHAFISDYAAESTMIKSRKEGEWFINNQLDKSANQDEIMGLKMFDGDHASSLARDRFNTETNKNDVFVDDSFMIQGPSVGDDQSDSQLRIGIGMVPEIEGAQYENGNSENVQKAASVSYEPDDLYMVLGRDSAEENAMTSWTPEIDYEMNVLSAEANGRHSDVETTGADDKGANGKNRGSSERKLSNKEVRSRVPNGSLVKSKSDIAAKTRKPPAGSRTTVRKTKFDQEEENRKKIEELRIQRQKRIAKRSVASGANPVTSRRSSTENKTSTISMKSQPLTQDTKKSPKPVLRSSTIERLATARNTSKASSAESKASQPKKSTLKENGSSTTVSQKTAPVEDKKSSSNKVRASDKKSGPNKVLSSDSVAQGKDSKEVTVALPTEPAAPRETQPTDIVDNFKDIQELQSTSIEKTEEKEISQRNTSEDRSSNGNMLTEDKPVQLDHVKGDEELTKASTVVSEDKRAPEDFVEDIPEMTVHPLPPLPVKTVKFATVNIEGNGGMNEKFLSPRISEIEISTPPPNDGMNTEPVHSRKKWNNDETSPKAAKGFRKLLFFGRKNRNSPTY |